Model Selection

End-to-end speech synthesis

# End-to-end speech synthesis

A Vietnamese text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Kinyarwandatts Female Voice

This is an end-to-end deep learning based Kinyarwanda text-to-speech (TTS) system, trained using Coqui's TTS library and YourTTS architecture.

Speech Synthesis

Transformers Other

DigitalUmuganda

Nepali male voice synthesis model based on VITS architecture, supporting high-quality text-to-speech functionality

Speech Synthesis

Transformers Other

Mms Tts Div Finetuned Md F01

This is a Transformer-based text-to-speech (TTS) model that supports Dhivehi language speech synthesis.

Speech Synthesis

Transformers Other

VITS is an end-to-end text-to-speech model based on adversarial learning and conditional variational autoencoder, supporting Chinese speech synthesis.

Speech Synthesis

Transformers Chinese

Marshallese text-to-speech model developed by Meta, using VITS end-to-end architecture to support high-quality speech synthesis

Speech Synthesis

Mms Tts Cmo Script Khmer

A Central Mnong text-to-speech model developed by Meta, supporting conversion of text to natural speech

Speech Synthesis

Mossi text-to-speech model developed by Meta, based on VITS architecture, supporting end-to-end speech synthesis

Speech Synthesis

Mms Tts Cak Dialect Southcentral

A Kaqchikel (South Central dialect) text-to-speech model developed by Meta, which is part of the MMS project and supports speech synthesis in multiple languages.

Speech Synthesis

The Runyoro text-to-speech model developed by Meta, which is part of the Massive Multilingual Speech (MMS) project

Speech Synthesis

A Text-to-Speech model for the Triqui (Chicahuaxtla dialect) developed by Meta, which is part of the Massive Multilingual Speech (MMS) project.

Speech Synthesis

The Lampung Api text-to-speech model developed by Meta, which is part of the MMS multilingual speech project

Speech Synthesis

A text-to-speech model for Trinidadian Creole developed by Meta, which uses the VITS architecture to achieve high-quality speech synthesis.

Speech Synthesis

A Chin and Bam text-to-speech model developed by Meta, part of the Massively Multilingual Speech (MMS) project.

Speech Synthesis

Shona text-to-speech model from Facebook's MMS project, implementing high-quality speech synthesis based on VITS architecture

Speech Synthesis

Samoan text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

A Kyrgyz text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis.

Speech Synthesis

Khmer text-to-speech model from Facebook's MMS project, implemented with VITS architecture for end-to-end speech synthesis

Speech Synthesis

A Bambara text-to-speech model developed by Meta, part of the Massively Multilingual Speech project, utilizing the VITS architecture for high-quality speech synthesis.

Speech Synthesis

A Shan text-to-speech model developed by Meta, part of the MMS project, supporting conversion of Shan text to natural speech.

Speech Synthesis

Mms Tts Azj Script Latin

North Azerbaijani (Latin script) text-to-speech model developed by Meta, part of the Massively Multilingual Speech project

Speech Synthesis

Pangasinan text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Oromo text-to-speech model from Facebook's MMS project, implementing end-to-end speech synthesis based on VITS architecture

Speech Synthesis

Kazakh text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

A Sango text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Garhwali text-to-speech model developed by Meta, supporting high-quality speech synthesis

Speech Synthesis

VITS architecture Lao TTS model developed by Meta, supporting end-to-end speech synthesis

Speech Synthesis

Amharic text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

A Fula text-to-speech model developed by Meta as part of the Massively Multilingual Speech project, supporting the conversion of Fula text into natural speech.

Speech Synthesis

A Finnish text-to-speech model developed by Facebook, based on the VITS architecture, supporting high-quality Finnish speech synthesis.

Speech Synthesis

Indonesian text-to-speech model from Facebook's MMS project, implemented with VITS architecture for end-to-end speech synthesis

Speech Synthesis

Akan text-to-speech model developed by Facebook, based on VITS architecture, supporting high-quality speech synthesis.

Speech Synthesis

Swahili text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Southern Min text-to-speech model released by Meta, based on VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Malayalam text-to-speech model in Facebook's MMS project, implementing end-to-end speech synthesis based on VITS architecture

Speech Synthesis

Haitian Creole text-to-speech model developed by Meta, part of the Massively Multilingual Speech (MMS) project

Speech Synthesis

Portuguese text-to-speech model from Facebook's MMS project, implementing high-quality speech synthesis based on the VITS architecture

Speech Synthesis

VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences. The model employs a conditional variational autoencoder (VAE) architecture, including a posterior encoder, decoder, and conditional prior module.

Speech Synthesis

kakao-enterprise

VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences.

Speech Synthesis

kakao-enterprise

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase